An efficient ensemble pruning algorithm using One-Path and Two-Trips searching approach
نویسنده
چکیده
Keywords: Ensemble pruning Backtracking algorithm Ensemble Pruning via BackTracking algorithm (EnPBT) One-Path and Two-Trips searching algorithm (OPTT) Pruning efficiency a b s t r a c t Ensemble pruning is substantial for the successful application of an ensemble system. A novel method for Ensemble Pruning via BackTracking algorithm (EnPBT) was proposed by us in our previous work. Back-tracking algorithm can systematically search for the solutions of a problem in a depth-first and jumping manner, suitable for solving all those large-scale combinatorial problems. The validity of EnPBT algorithm has been verified in our previous work. However, the relatively slow pruning speed might be a drawback of EnPBT. Aiming at this problem, an efficient and novel ensemble pruning algorithm is proposed in this paper, i.e. One-Path and Two-Trips (OPTT) ensemble pruning algorithm. It is very fast in pruning speed, while its classification performance has no significant difference with EnPBT, as demonstrated in the experimental results of this work. In short, OPTT achieves a proper trade-off between pruning effectiveness and efficiency. An ensemble refers to a set of component models whose predictions are integrated by simple unweighted voting or weighted voting [1]. Ensemble learning has attracted a lot of attentions from machine learning community since the last decade, owing to its potential in prediction accuracy increasing [2–7]. Remarkable improvement in generalization performance has been observed from ensemble learning in a broad scope of application fields, for example: face recognition [8], optical character recognition [9], scientific image analysis [10,11], medical diagnosis [12,13], financial time series prediction [10], military purposes [14], intrusion detection [15], etc. Despite its remarkable performance, a major drawback of ensemble learning is that, generally, it is necessary to combine a large number of classifiers to ensure that the error converges to its asymptotic value. This entails large memory requirement and slow speed of classification. These aspects can be critical in online applications [16–18]. A possible way to alleviate these shortcomings is the selection of a fraction of the classifiers from the original ensemble. Besides the reduction in complexity, ensemble pruning has other potential benefits. In particular, a subset of complementary classifiers can perform better than the complete ensemble [1,16,17,19–24]. However, it has been proven to be an NP-complete problem to realize ensemble pruning effectively and efficiently. Enumerative algorithm for searching the best subset of member networks is not easily worked for ensemble that contains a large number of constituent models. Greedy …
منابع مشابه
Reachability checking in complex and concurrent software systems using intelligent search methods
Software system verification is an efficient technique for ensuring the correctness of a software product, especially in safety-critical systems in which a small bug may have disastrous consequences. The goal of software verification is to ensure that the product fulfills the requirements. Studies show that the cost of finding and fixing errors in design time is less than finding and fixing the...
متن کاملA Heuristic Algorithm for Nonlinear Lexicography Goal Programming with an Efficient Initial Solution
In this paper, a heuristic algorithm is proposed in order to solve a nonlinear lexicography goal programming (NLGP) by using an efficient initial point. Some numerical experiments showed that the search quality by the proposed heuristic in a multiple objectives problem depends on the initial point features, so in the proposed approach the initial point is retrieved by Data Envelopment Analysis...
متن کاملOptimum Ensemble Classification for Fully Polarimetric SAR Data Using Global-Local Classification Approach
In this paper, a proposed ensemble classification for fully polarimetric synthetic aperture radar (PolSAR) data using a global-local classification approach is presented. In the first step, to perform the global classification, the training feature space is divided into a specified number of clusters. In the next step to carry out the local classification over each of these clusters, which cont...
متن کاملAn Effective Path-aware Approach for Keyword Search over Data Graphs
Abstract—Keyword Search is known as a user-friendly alternative for structured languages to retrieve information from graph-structured data. Efficient retrieving of relevant answers to a keyword query and effective ranking of these answers according to their relevance are two main challenges in the keyword search over graph-structured data. In this paper, a novel scoring function is proposed, w...
متن کاملA Preprocessing Technique to Investigate the Stability of Multi-Objective Heuristic Ensemble Classifiers
Background and Objectives: According to the random nature of heuristic algorithms, stability analysis of heuristic ensemble classifiers has particular importance. Methods: The novelty of this paper is using a statistical method consists of Plackett-Burman design, and Taguchi for the first time to specify not only important parameters, but also optimal levels for them. Minitab and Design Expert ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Knowl.-Based Syst.
دوره 51 شماره
صفحات -
تاریخ انتشار 2013